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NONPARAMETRIC  TEST  FOR  THE  SLOPE  OF  A 
TRUNCATED  REGRESSION 

Short  Title:  TEST  FOR  A  TRUNCATED  REGRESSION) 

By  P.  K.  Bhattacharya ,  University  of  Arizona 
ABSTRACT 

Consider  the  hypothesis  Hgte  =  Bq  >  0  in  a  linear  regression  model 
vhere  the  cdf  of  Y  -  ex  is  unknown  and  Y  is  subject  to  the  truncation 

Y  i  Yq-  Testing  Hq  on  the  basis  of  n  independent  (x^,Y^)  with  x1 

1  •••  1  xn  equivalent  to  testing  the  underlying  homogeneity  of  the  in¬ 
dependent  V.  =  Y.  -  6qX^  subject  to  progressive  truncation  V.  <  wi 
=  yQ  -  SqX.  .  For  analyzing  astronomical  observations  a  test  has  been  pro¬ 
posed  in  the  literature,  which  computes  the  sequential  ranks  R^  of  V. 
among  N.  "comparable"  V.,  j  <_  i  satisfying  V,  ^  w.  and  compares  the 

J  J  * 

empirical  cdf  Hn(t)  of  (2R^  -  1 )/2N.  with  t  by  a  K  -  S  statistic. 

Since  (2Ri  -  1)/2N^  are  neither  indeoendent  nor  exactly  uniform  [0,1], 

the  applicability  of  the  usual  asymptotic  null  distribution  of  the  K  -  S 
statistic  in  this  context  needs  justification  which  is  provided  in  this 
paper  under  a  sufficient  condition  requiring  that  the  rate  of  progressive 
truncation  is  not  too  severe. 

^This  research  was  supported  by  NSF  grants  HCS-78-01108  and  MCS-80-02774. 

Key  Words:  Truncated  Regression,  Hubble  Diagram,  Testing  Homogeneity 
Under  Progressive  Truncation,  Kolmogorov-Smirnov  Test, 
Sequential  Rank,  Asymptotic  Distribution,  Brownian  Bridge. 
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1.  Introduction 

Consider  a  linear  regression  model  in  which  Y*  -  t-x* ,  j  =  1,  2,  ... 

J  J 

are  iid  with  cdf  F.  Suppose  we  observe  (x*,Y^)  only  if  Y*  <  y_  and 

J  J  J 

let  (x*.  ,YT  )  =  (x.,Y.),  i  =  1,  2,  ...  denote  the  observable  pairs.  Then 
J  i  J  i  11 

Y,,  Y^,  ...  are  independent  and 


P[ Y i  <  y]  =  F (ni n{y ,yQ)  -  ax^/Ffyg  -  =xi). 

F  is  continuous  but  otherwise  unknown,  v^  is  a  known  constant  and  we  are 
interested  in  inference  about  the  regression  coefficient  £. 

In  the  non-truncated  case,  Theil  (1950)  proposed  a  nonparametr ic 
estimate  of  i  based  on  Kendall's  tau  and  Sen  (1963)  derived  its  asyrototi: 
nronerties.  Bhattacharya,  Yang  and  Chernoff  (1980)  developed  a  modification 
of  Theil's  estimate  to  suit  the  truncated  case.  In  an  entirely  different  ap¬ 
proach  to  the  truncated  regression  problem.  Turner  (1979)  used  a  Kolmogorov- 
Srirnov  test  based  on  sequential  ranks  for  testing  Hq : £  =  3q  s  0  in  analyzing 
astronomical  observations  where  truncated  scatter  diagrams  called  Hubble 
diagrams  are  obtained  as  plots  of  luminosity  distance  versus  redshift  of 
various  celestial  objects. 

To  describe  the  test  statistic  in  a  sample  of  size  n,  arrange 

x,,  ....  x„  as  x  ,<...<  x  „  (equal  x's  being  arranged  arbitrarily) 

I  n  ni  —  —  nn 

and  define  Y  .  =  Y.  if  x  .  =  x. .  Under  Hn,  the  residuals  V  .  =  Y  . 

ni  k  ni  k  0  ni  ni 

-  -gxn^  are  iid  observations  subject  to  progressive  truncation  Vn-  wni 
3  Y0  '  :0xni  w1th  wnl  -  ‘ - -  wnn  since  *0  ispositive'  Each  vni 


3 


is  comparable  to  only  those  Vnj,  j  <  i  for  which  Vnj  ^  wn- ,  because  the 

conditional  distribution  of  V  .  given  V  .  •-  w  .  is  the  sane  as  the  dis- 

nj  nj  -  m 

tribution  of  V  •  under  FL.  Conseouently ,  under  H. ,  the  rank  of  each 
m  0  0 

Vni  among  comparable  Vn^ ,  j  <_  i,  is  uniformly  distributed  over  all 
possibilities.  Formally,  letting 


i:Vnj  -  wni 


-  V 


=  rank  of  V  .  among  {V  . : j  £  S  .  ■  r  .  = 

m  3  ‘  nj  J  m  *  Tni 


(2,„i  - 


Turner  heuri stical ly  argues  that  under  HQ,  the  ;ni  are  asymptotically 

unif  [0,1],  so  that  their  empirical  cdf  Hn(t)  should  closely  resemble  t, 

and  D*  =  ,/n  sup  LH  (t)  -  t]  can  be  used  for  a  one-sided  test  of  Hn. 
n  n  a.  i  n  u 

0<t<l 

This  argument  can  be  justified  when  there  is  no  truncation,  because 
then  =  i  and  Rni  are  independent  with  P[Rni  =  r]  =  1/r,  1  <  r  .i, 
as  shown  by  Parent  (1965)  and  Bhattacharya  and  Frierson  (1931 ).  Under  trunca¬ 
tion,  the  validity  of  D*  as  a  test  statistic  is  not  so  obvious,  because 
the  behavior  of  and  Rni  become  more  complicated  and  the  £ni  become 

dependent.  Yet  a  Monte  Carlo  study  supported  the  fact  that  the  asymptotic 

null  distribution  of  D+  is  what  it  should  be  if  l  .  were  iid  unif  [0,1 j. 

n  m 

In  this  study,  were  taken  to  be  i/n,  =  =  1  and  were 

unif  [0,c]  +  3x  .  subject  to  the  truncation  Y  .  <  where  c  >  yn  -  1 . 

ni  m  —  u  u 

This  was  achieved  by  generating  V  ^  as  unif  [0,wn^]  with  wni  =  y^  -  i/n. 
Fifteen  cases  involving  y^  =  2.0,  1.5,  1.1  and  n  =  10,  25,  50,  100,  200 
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were  considered  with  100  runs  for  each  case  and  D  was  computed  for  each 

n  o 

+ 

run.  The  asymptotic  cdf  of  Dn  from  independent  unif  [0,1]  is  1  -  e 
having  mean  /n'/8  =  0.6267,  st.  dev.  /(4  -  *  )/8  =  0.3276  and  upper  10c; 
and  SZ  ooints  1.07298  and  1.22387.  The  relative  frequencies  of  Dn 
lying  above  the  theoretical  percentage  points  and  their  means  and  standard 
derivations  for  the  fifteen  cases  are  given  in  the  following  table.  Though 
the  Monte  Carlo  experiments  tend  to  give  rise  to  somewhat  smaller  values 
then  expected  from  the  theoretical  distribution  due  to  discreteness  of  the 
Cni ,  there  is  strong  indication  (especially  in  respect  of  the  mean)  that 
the  asymptotic  behavior  of  D*  is  not  affected  by  truncation. 

The  purpose  of  this  paper  is  to  examine  how  far  Turner's  procedure 
can  be  justified  in  the  presence  of  truncation  so  as  to  explain  the  above  Monte 
Carlo  results.  The  main  result  is  a  sufficient  condition  for  the  applicability 
of  the  usual  asymptotic  null  distribution  in  this  context  which  requires 

lim  inf  min  i_0/2+<5)  j  p(w  .)/F(w  .)  >  0 
n-~  kn<i<n  j=l  nl  nJ 

1  -1/2 
for  some  0  <  6  <  «•  and  some  sequence  (k  }  such  that  k  -*  *  and  n  k  -  0 
—  c  ^  n  n  n 

as  n  -*  co.  The  condition  means  that  the  rate  of  progressive  truncation  is  not 
too  severe. 

In  the  above  Monte  Carlo  experiments,  V  ^  were  unif  [0,wn1- ]  with 

i  /  p.  5 

wni  =  y0  '  ^n’  Lett1n9  t  =  i/ny0  and  h( t)  =  t  (1  -  t),  we  have 

,-0/2*«)  j  F(wn,)/F(wnj).  r(1/z*8,(l  -  i/ny0)  j  O  -  j/nyor' 
j“l 

>  (ny0)1/2-6t'(1/2+5)(l  -  t ) 1 og (1  -  t)_1  >  ( nyQ ) 1 /2_ 5h ( t ) . 
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1  -1/2 
Hence  for  a  •-  0  and  k  >  -  at  a  rate  slower  than  n  , 

2  n 

min  j  F{w  .  )/F(w  . )  >  {nyn)^/2~6  min  h(t) 

kno<n  J-l  "J  0  kB'ny0<t:.l/y0 

=  (ny0)1/2  <min{h(|<n/nyo)’h(1/yo) } 

=  min{kj/2_6(l  -  kn/ny0),n1/2‘6(l  -  l/yQ) } 


not  only  stays  positive  but  tends  to  +  »  as  n  -  *.  This  exolains  why 
the  effect  of  truncation  appears  to  be  negligible  in  these  experiments. 


2.  Joint  Distribution  of  <_  i  nl. 

Let  Uni  ”  F<Vn1>  and  ant  '  F("ni>'  Then  ’  i  anl  i  • ' ■  i  a„„  >  0 
and  are  independent  unif  [0,ani].  Formulas  (1)  for  Sni,  N^,  Rn-, 

are  equivalently  exoressed  by  substituting  U  .  and  a  .  for  V  . 

ni  J  J  3  m  m  m 

and  w  .  respectively.  We  now  obtain  the  distribution  of  tN  .}  and  the 
m  J  m 

conditional  distribution  of  {R„- }  given  {N  .}. 

m  m 


Theorem  1 .  For  each  n,  {N^}  is  a  Markov  chain  with 


P^n,i+l  *  =  m-’  *  (k-l»an.H1/a„i>l‘"'<'  ‘  a„,,+,/a„i 


for  1  <  k  <  m  -  1,  startinq  at  N  ,  =  1  and  for  given  (N  . R  .  are 
—  —  nt  ni  ni 

conditonally  uniformly  distributed  on  {1,  }  and  a^e  conditionally 

independent. 

Fix  n  and  suppress  n  in  the  subscripts  of  ani,  Un-,  Sni,  Nn^ 
and  R^ .  Thus  1  i  >  ...  >  an  >  0,  U^,  ...,  U  are  independent 
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uni f  [O.a^].  S..  =  {j  _<  i  :U j  ^  a.  },  N..  =  #  and  R.  is  the  rank  of  U. 

among  Uhrj  e  }.  For  N.  =  m  and  =  (v^,  •••♦  vin)  with 

v.,  <  ...  <  v.  =  i,  let  UT.  =  U  identify  the  randon  variables 
ii  im  i  j  v- ■ 

1  J 

fU . : j  £  S.  }  in  their  appropriate  order  in  each  of  the  random  sets  S^. 

For  brevity  of  notations,  denote  the  collection  and 

{Uj:j  <  i,j  &  S.j  }  by  C^.  Let  R^,  R^  denote  the  sequential  ranks 
A  ^ 

and  k^,  the  usual  ranks  of  U*..^  ,  i.e.,  R^  and 

R.^  are  the  ranks  of  among  U^,  U^.  and 

respectively.  Clearly,  R.  =  R.M  =  R.w  .  The  following  property  of  the 

i  l.^  IN. 

U*.'s  is  immediate. 

*  J 


Ci  * 


Lemma  1 .  UT^,  U*N  are  conditionally  iid  unif  [O.a^  given 


From  Lemma  1  we  draw  the  following  conclusions. 


Lemma  2.  P[R.  =  rjC-.R.p  ....  Ri>N  =  NT1  =  P[R.  =  r |Ni ] 


1  <  r  <  N.. 


Proof.  By  Lemma  1,  R^,  ...,  R^  ^  and  R^  =  R^  are  sequential 

ranks  of  random  variables  which  are  condi tional ly  iid  given  and  the  first 
equality  follows  from  a  known  property  of  sequential  ranks  of  iid  rv's 
(see  Bhattacharya  and  Frierson  (1981),  Lemma  1).  The  second  equality  is 
obvious  because  this  conditional  probability  depends  only  on  . 
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Lenina  3.  P[Ni+]  =  klC-.R.j,  ....  Ri  >N  .j.Rj] 

N.  N.-k+l 

=  {k-l)(ai+l/ai)k'1(1  “  ai+l/ai)  1  =  pCNi+1  =  klNi]  for  1  i  k  <  N.  +  1 . 

Proof.  The  order  statistics  (UT^,  ....  ll|^N  of  U^,  .... 

are  conditionally  independent  of  their  ranks  (R^,  RiN  )  and  hence  of 
their  sequential  ranks  (R. R.  ,  R.)  given  C-  since  the  two  rank 

11  I  j  11  •  “  I  »  1  1 

vectors  are  in  one-one  correspondence.  The  first  equality  now  follows  because 
Ni+i  =  k  if  and  only  if  1  ai+i  <  ^i(k)  anci  t^'e  seconc*  equality  is 

obtained  as  in  Lemma  1. 


The  next  lerma  follows  by  standard  arguments  involving  conditional 
expectations  and  we  omit  its  proof. 

Lemma  4.  Let  X,  Y,  Z,  T  be  random  variables  mapping  a  probability 
space  into  appropriate  ranges.  Suppose  T  and  Y  are  conditionally  inde¬ 
pendent  given  X,  and  Z  is  determined  by  X  and  Y.  Then  T  and  Z  are 
conditionally  independent  given  X. 


Proof  of  Theorem  1 .  The  crucial  thing  to  observe  is  that  (N^R^, 
....  (N-_i ,R._^ )  are  determined  by  C.  and  R.^,  ...,  R.  ^  .  Hence 

(Nj.Rj),  ....  (Ifj  j.R.  .j),  N.  are  determined  by  C.,  R.j,  ....  R.  N  _1 

and  (Nj.Rj),  ....  (N^.R^)  are  determined  by  C^,  R.^,  ....  R^  N_(»  Pa¬ 


using  this  fact  in  conjunction  with  Lemmas  2,  3  and  4,  we  conclude  that  for 


2  <  i  <  n. 
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P[Ri  =  r|(N1,R1), 


("i-l  *Ri - !  «  p[«i 


r  |  N  -  ] 


n:1 

i 


N. 

l 


and 


P[Ni+1  =  k|(Ni,R1),  ....  (N.,R.)]  =  P[N.+1  «  k|N.] 


N  N  -k+1 

<’ -WV1  ■ 


The  theorem  now  follows  because  (N^,R^)  is  trivially  (1,1)  and  for  k  =  1 
or  2,  P[N2  =  k|  N.j ,  R-j  ]  =  P[N2  =  k]  is  given  by  P[U  -j  >  a2]  =  1  -  a2/a1  and 
P[U1  <_  a2]  =  a2/a1  respectively. 

3.  Asymptotics 

By  Theorem  1,  Cni  =  (2Rni  -  1 )/2Nni  are  conditionally  independent 

given  n  =  (N  ,,...,  N  )  with  conditional  cdf  G  .(tin  )  increasing  by  jumps 
n  nl  nn  m'n  3 

of  1/N^  at  (2r  -  l)/2Nn.,  1  <_  r  <  Nn*.  The  question  is,  how  small  should 
the  eni(t,nn)  =  Gn^(t|n  )  -  t  be  so  that  the  normalized  empirical  cdf  Hn ( t ) 
of  the  viz.  Xn(t)  =  n-1^[Hn(t)  -  t]  will  behave  like  the  Brownian 

bridge  B*(t)  on  [0,1]?  We  discuss  weak  convergence  of  the  empirical  cdf  of 
conditionally  independent  random  variables  in  the  Appendix,  which  may  be  of 
some  interest  in  itself,  and  show  that  Xn(t)  converges  weakly  to  B*(t) 

provided  that  n~ w  (t,nn)  I  converges  uniformly  to  0  in  probability 

(Theorem  4).  In  the  present  context,  |ep^ (t,n  )  |  £  (2Nn^)"1.  Hence  a  suf- 

ficient  condition  for  the  desired  convergence  is  n" '  [N~ !  -i-,  0  as  n  -*  «. 


We  summarize  this  in  the  following  theorem. 
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Lenina  3.  P[N.+J  =  k  | C^-  ,R^,  . 


P[Ni  +  1  =  k|Ni]  for  1  i  k  i  N.  +  1 . 


Proof.  The  order  statistics  •••»  U^(N.)^  Utl*  UiN. 

are  conditionally  independent  of  their  ranks  (R.^,  •••»  R^  )  and  her.ce  of 
their  sequential  ranks  (R.,,  ....  R.  M  ,  R • )  given  C.  since  the  two  rank 

vectors  are  in  one-one  correspondence.  The  first  equality  now  follows  because 
N^+i  =  k  if  and  only  if  ^  <_  a.+^  <  and  the  second  equality  is 

obtained  as  in  Lemma  1. 


The  next  lemna  follows  by  standard  arguments  involving  conditional 
expectations  and  we  omit  its  proof. 

Lemma  4.  Let  X,  Y,  Z,  T  be  random  variables  mapoing  a  probability 
space  into  appropriate  ranges.  Suppose  T  and  Y  are  conditionally  inde- 
Dendent  given  X,  and  Z  is  determined  by  X  and  Y.  Then  T  and  Z  are 
condi tional ly  independent  given  X. 

Proof  of  Theorem  1 .  The  crucial  thing  to  observe  is  that  (N^,R^), 

....  )  are  determined  by  and  R.^,  ...,  R^  j^.-l*  Hence 

(N1  ,R] ) ,  (Nil  »Ri_1 ) »  N.  are  determined  by  C-,  R^,  ...,  R^N<_1 

and  (Nj,Rj),  ....  (N^R^)  are  determined  by  C.,  R^,  ....  Ri  N  .i»  Re¬ 
using  this  fact  in  conjunction  with  Lemmas  2,  3  and  4,  we  conclude  that  for 

2  <  i  <  n. 
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Pt^  =  ....  (N._1>Ril),Ni]  =  P[R.  =  r  1 N  i  ]  = 


N. 


and 


P[N.+]  =  k|(N.,R1 ),  ....  (N.,R.)]  =  P[N.+1  =  k|N.] 


N.  .  ,  N.-k+l 

=  <k-l^ai+l/ai>  0  *  ai+l/ai>  1  ’  1  Ik  IN.  +  1 


The  theorem  now  follows  because  ( Ni , R-j )  is  trivially  (1,1)  and  for  k  =  1 
or  2,  P[N2  =  k| N-j  ,R-j  ]  =  P[N2  =  k]  is  given  by  P[U-|  >  a,,]  =  1  -  a2/a1  and 
P[U1  <_  a^]  =  a2/a1  respectively. 

3.  Asymptotics 

By  Theorem  1,  ?ni  =  (2Rni  -  l)/2Nni  are  conditionally  independent 

given  nn  =  (Nr1  ,  Nnn)  with  conditional  cdf  Gni(tjnn)  increasing  by  jumps 

of  1/N  .  at  (2r  -  1 )/2N  ■,  1  <  r  <  N  ..  The  question  is,  how  small  should 

n  l  ;  n  l  —  —  n  i 

the  cni(t,nn)  =  Gnl.(t|nn)  -  t  be  so  that  the  normalized  empirical  cdf  Hn ( t ) 
of  the  r  ,  viz.  Xn(t)  =  n’1//2[Hn(t)  -  t]  will  behave  like  the  Brownian 
bridge  B*(t)  on  [0,1]?  We  discuss  weak  convergence  of  the  empirical  cdf  c 
conditionally  independent  random  variables  in  the  Appendix,  which  may  be  of 
some  interest  in  itself,  and  show  that  Xn(t)  converges  weakly  to  B*(t) 

provided  that  n"  '  £leni  I  converges  uniformly  to  0  in  probability 

(Theorem  4).  In  the  present  context,  | c  - ( t ,nn) |  ±  (2N^)  ^ •  Hence  a  suf- 

i  1 2  n  ^  _ 

ficient  condition  for  the  desired  convergence  is  n  TN  .  — ►  0  as  n  * 


We  summarize  this  in  the  following  theorem. 


1/2 

Theorem  2.  As  N  -  <■>,  Xn ( t )  =  n‘  [Hp(t)  ‘  0  converges  weakly 
to  the  Brownian  bridge  B*(t)  in  Skorokhod  topology  on  D[0,1]  provided 

that  kl  ■*.  0. 

We  now  derive  a  condition  in  terms  of  severity  of  the  rate  of 
progressive  truncations,  which  guarantees  the  above  convergence. 

Theorem  3.  n-1^2  Tn‘!  -2-  0  as  n  ->■  «.  Drovided  that  there  is  a 

-  4.  nl 

1/2 

sequence  kp  -»  °°  with  kpn  '  -►  0  such  that  for  some  5  >  0, 

(2)  lim  inf  min  i-1/2-5  £  a  ./a  .  >  0. 

n-«°  k  <i<n  J=1  J 


Proof.  For  arbitrary  c  >  0,  choose  n  so  large  that 


(3) 


2(1  -«)-V4/2n 


(knn 


"  knn 


-1/2 


holds  for  kp  and  6  <  1  satisfying  the  hypothesis  of  the  theorem.  Since 
Npi  >  1,  the  event  {Npi  >  i^+<^/2,kn  +  1  <  i  <  n)  implies 


n'1/2  ?N-J  <  k  n'1/2  +  n'1/2  [  i'(1^)/2 
1  n1  ~  n  k+1 


<kn'1/2  +  2(1-6) -1(n(1-6)/2-ki1-^/2)n-1/2< 


by  (3).  Hence  for  sufficiently  large  n 
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Appendix 

Consider  a  sequence  of  random  vectors  {nn)  and  a  triangular  array 
of  random  variables  Uni»l  £  i  <_  n},  n  =  1,  2,  ...  such  that  for  each  n, 
the  ;  .  are  conditionally  independent  given  nn  with  conditional  cdf's 
Gfii  ( ! nn ) *  We  obta’n  a  sufficient  condition  for  weak  convergence  of  ap¬ 
propriately  normalized  empirical  cdf's  of  {£  .},  viz. 

Xn(t)  =  n1/2[n  1  JI[(U]Uni)  -  t],  0  <  t  <  1 

to  the  Brownian  bridge  {B*(t),0  <_  t  <  1 }  in  Skorokhod  topology  on  D[0,1]. 
Write 


XJt)  =  n'1/2 


{J[0,t] 


(^ni’nn)  +  n’1/2  j-Gni 


where  J[0ft](«n1.nn>  -  “  Gni(t^n) 

independent  given  nn  with  conditional  mean  0 

Gni^slnn^{1  '  Gni^lnn^}  for  °lS£t<l, 
Suppose  the  following  condition  holds. 


,  1  <_  i  <_  n  are  conditionally 

and  conditional  covariances 


and  eni(t,nn)  =  5ni(t|n„)  -  t. 


n 


1/2  n  q 

Condition  A.  sup  n  /|*.  .(t,n  )|  —  *  0  as  n  »  ». 
t  1  n 

-1/2  ^ 

Then  n  '  J\.  . (t,n  )  converges  to  0  in  nrobability  uniformly  and  hence 

■j  n  i  n 

in  Skorokhod  topology,  so  that  it  is  enough  to  show  that 


(6) 

in  Skorokhod  topology  on  D[0,1].  The  following  theoren  says  that  we  need 
nothing  more  than  Condition  A  for  the  required  convergence. 

Theorem  4.  Under  Condition  A,  Xn(t)  — *  B*(t)  in  Skorokhod  topology 
on  D[0,1]. 

To  prove  Theorem  4,  we  need  the  following  extended  version  of 
Theorem  15.6  of  Billingsley  (1968). 

Theorem  5.  Suppose  that  the  finite-dimensional  distributions  (fdd) 
of  (Y  }  converge  to  those  of  Y  and  that  Y  is  left-continuous  at  1 
a.s.  Suppose  further  that 


(7) 


E[|Y„(t)  -  Yn(t,)lY|Yn(t2) 


v„(t) 


for  t ^  .f.  t  t g  and  n  _>  1 ,  where  y  >_  0,  a  >  ^,  and  $n  are  a.s.  non¬ 
decreasing  random  functions  (depending  on  nn)  converging  pointwise  in 
probability  to  a  continuous  (hence  uniformly  continuous)  function  *  on 
[0,1].  Then  YR  -£* 


Y. 
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Proof  of  Theorem  5.  Using  Billingsley's  notation,  we  need  to  show 
that  for  given  .  ,  •  ^  we  can  find  such  that  P[w"(Yn,‘)  >.  ]  •  ^ 

for  sufficiently  large  n.  Proceeding  as  in  Billingsley's  proof  we  arrve 
at  a  counterpart  of  his  (15.29)  with  t  in  place  of  F,  fron  which  it 
follows  that  for  s  =  (2u)  \  u  positive  integer, 

P[w"(Yn,;.)  >  .  !nn]  '  2Kt  _2>[;n(l  )  -  :n(0)  ]V^  1-1  . 

V  =  max-  max  [ :  ( (2i  +  2)-)  -  :(2if>)],  max  [ :  ( ( 2i  ♦  3)‘)  -  :  ((2i  ♦  1 
n  0  i  u- 1  "  n  (Mmj-2  n  n 

For  given  >  '  ,  let  B„  ,  -  ■  :  sup  •  ( t )  -  ;  ( t )  •  •  '  .  By  the  hypothesis  of 

rit.  n  ^  n 

the  theorem,  ;  actually  converges  uniformly  in  probability  to  so 

P[BC  ,]  ■  ,/2  for  large  n.  Now  choose  ‘  such  that  ; ( s )  - 

nt  i 

for  s  -  t  2;.  Then  for  €  B  ,,  !  .*  (s)  -  :  (t)  ,  •  3-.  '  for 

n  n*  n  n 

s  -  t  •  2- ,  so  that  Vn  •  3,  '  and  ; n ( 1 )  -  :n( 3)  ■  : ( 1 )  -  ; (0)  *  2-  ' 

•  2- ; (1 )  -  : (0)  for  small  t ' .  Thus  for  n  €  B„  , , 

n  nt 

P[w"(Yn.*)  _>  f|nn]  14K,'Z>[;(1)  -  ttOnOc')2®"1. 

Choosing  t*  so  that  the  RHS  of  the  last  inequality  is  •  j/2,  we  now  have 

P[w"( Yn.«)  >  c]  !  4Kt *Zy[ * { 1 )  -  «(0)](3c  ' )Z*-1  ♦  P[B‘c.l  <  , }, 


and  the  thoerem  is  proved. 


Proof  of  Theorem  4.  By  Condition  A,  it  suffices  to  show  that  (6) 

holds.  We  first  sho w  the  convergence  of  ffd.  Fix  a  positive  integer  r, 

r 

0  -  h  -  •"  -  'r  -  ll . V  rMl  and  "rUs  r-m  * 

Then  ....  r.nn  are  conditionally  independent  given  with 


*nn 


E( 


ni  n 


)  -  0. 


'ni  1 


are 


bounded  by  r^^lh!!  -  (r  )'  and 


.-1/2" 


|  J  n  j 
we  have 


(t.)  =  n"  ';nr  Using  Cov[B*(s) ,B*(t)]  =  s(l  -  t),  0  -  S  L  t  •  1, 


2  -  Var[7.  .B*(t.)]  »  >  x?t.(l  -  t.)  ♦  2  >  ,  v , ,  t .  (1  -  t  • ,  ) , 

T  J  J  j  =  l  J  J  J  l^j<j  ‘  <r  J  J  J  J 

2  C  _1  n 

and  n  1  Var[- j Yn ( t j )  1  r»n 3  =  n  £  VarU^.;?^]  is  obtained  by  substituting 


-1  0 


'  -  V>  ‘  inl(tV’n) 


for  t  (1  -  t - . )  in  the  formula  for  i  .  Since  7 *  B* ( t . )  is  N(0,o  ), 
J  J  T  J  J 


we  only  need  to  show  that 


(8) 


lim  P[n"l,?Jcf|1  i  oy] 


=  $(y)  for  all  y. 


n  — 


where  »  is  the  cdf  of  N(0,1).  For  arbitrary 


t  >  0,  let 
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An,  "  lnn:n’1/2J11'ni(tj*nn)l  -fJ  -  j  -  rK  For  nn€V*  !  n7  ^  '  1 

2  2 

^  Kj<  with  =  (6r  -  4)|ia|;  /r-  .  Hence  for  small  c,  n/.:  lies  between 
1  +  ,  so  that 


|'>(jy/on)  -  t>(y ) |  <  My  +  K1  ]y | e )  -  My  -  k,  !ylc). 


On  the  other  hand,  for  n  6  A  with  small  c,  we  use  the  bound  on  :  . 

n  ne  m 

to  obtain  the  Berry- Esseen  bound 


|P[n  1  / i  oy|nn]  -  Moy/o  )| 


<  n‘1/2C{K2/(l  -  K^)}372  <  2n‘1/2CK2 


with  K2  =  rj  a  j;  a  and  a  universal  constant  C.  Since  c  >  0  is  arbitrary, 
(8)  now  follows  from  (9),  (10)  and  the  fact  that  lim  P[An^]  -  1  by  Condition 

n-~» 

A.  To  complete  the  proof  of  Theorem  4,  we  only  need  to  verify  that 
satisfies  the  conditions  of  Theorem  5  with  >  =  2,  a  =  1,  ;n(t) 


=  n’  £G  ,(tjn  )  and  Mt)  =  t.  The  conditional  moment  inequality  (7)  is 

i  iii  n 

-in 

established  by  easy  but  tedious  algebra  which  we  omit  and  n  FG  Mt  n  ) 

v  n  l  n 

-ln 

-  t  =  n  VVMt.n  )  uniformly  converges  to  0  in  probability  by  Condition  A. 
¥  ni  n 


17 


References 

[1]  Bhattacharya,  P.  K. ,  fang,  S.  S.  and  Chernoff,  Herman.  (1930).  lion- 
parametric  estimate  of  the  slope  of  a  truncated  regression.  Tech. 
Rep.  No.  18,  Massachusetts  Institute  of  Technology. 

[2]  Bhattacharya,  P.  K.  and  Frierson,  Dargan,  Jr.  (1981).  A  nonpara- 
metric  control -chart  for  detecting  small  disorders.  Ann.  Statist, 
to  appear  in  the  July  issue. 

[3]  Billingsley,  Patrick  (1968).  Convergence  of  Probability  Measures. 
Wiley,  New  York. 

[4]  Hoeffding,  Wassily  (1963).  Probability  inequalities  for  suns  of 
bounded  random  variables.  J.  Amer.  Statist.  Assoc.  58,  13-30. 

[5]  Parent,  E.  A.,  Jr.  (1965).  Sequential  ranking  procedures.  Doctoral 
Dissertation,  Stanford  University. 

[6]  Sen,  P.  K.  (1968).  Estimates  of  the  regression  coefficient  based  on 
Kendall's  tau.  J.  Amer.  Statist.  Assoc.  63^,  1379-1389. 

[7]  Theil,  H.  (1950).  A  rank  invariant  method  of  linear  and  polynomial 
regression  analysis,  I,  II  and  III,  Nederl .  Akad.  Wetensch.  Proc. 

53,  386-392,  521-525,  1397-1412. 

[8]  Turner,  Edwin  L.  (1979).  Statistics  of  the  Hubble  diagram.  I. 
Determination  of  qg  and  luminosity  evolution  with  application 

to  quasars.  Astrophys.  Jour.  230,  291-303. 


E 


SECURITY  Clarification  or  THIS  a  Ar.r  rm,«i  Data  Emararf) 


REPORT  DOCUMENTATION  PAGE 


.  REPORT  NUMBER 


«.  TITLE  land  JuAllll.J 


Nonparametric  Test  for  the  Slope  of  a 
Truncated  Regression 


READ  INSTRUCTIONS 
BEFORE  COMPLETING  FORM 


J  REC.IPI«jJi’S  catalog  NUMBER 


S.  Type  or  REPORT  *  PERIOO  COVEReO 


Technical  Report 


••  PERFORMING  ORG.  REPORT  NUMBER 


7.  AU  THORf 


P,  K.  Bhattacharya,  University  of  Arizona 


•.  CONTRACT  OR  GRANT  NUMflCR(«; 

N00014-75-C-0555 


».  PERFORMING  ORGANIZATION  name  ANO  ADORES! 

Department  of  Mathematics 
Massachusetts  Institute  of  Technology 
Cambridge,  Massachusetts  02139 


n.  CONTROLLING  OFFICE  NAME  AND  AOORESS 


12.  REPORT  OATS 


January  12,  1981 


is.  number  or  pages 


i 


Office  of  Naval  Research 

Statistics  &  Probability  Program  Code  436 

Arlington,  Virginia  "2221  7  _ 


MONITORING  AGENCY  NAME  a  AOORESSf//  dl  Moroni  from  Controlling  Otttco)  IS.  SECURITY  CLASS,  (ot  its  to  ropon) 


Unclassi fied 


IS*.  OCCLASSirtCATION.  OG WN GRACING 
SCHEOULE 


DISTRIBUTION  STATEMENT  (al  UUa  Aapari; 


APPROVED  FOR  PUBLIC  RELEASE:  DISTRIBUTION  UNLIMITED. 


17.  DISTRIBUTION  STATEMENT  fot  tho  mbotrmct  on  fro*  in  Block  20.  II  dllloroni  from  Report) 


If  KEY  WOROS  f  Continue  on  roeoteo  «N«  ll  nocoooorr  end  Identity  by  block  nxmbor) 


Truncated  Regression,  Hubble  Diagram,  Testing  Homogeneity,  Under 
Progressive  Truncation,  Kolmogorov-Smi rnov  Test,  Sequential  Rank, 
Asymptotic  Distribution,  Brownian  Bridge. 


20  Apfl  RACT  (Continue  on  rmoeree  •  Ido  ll 


ooey  md  Identity  by  block  number) 


(see  reverse  side) 


DO  I  jl”|  1473  EOlTiDN  or  I  NOV  »s  ll  OBSOLETE 

s  k  a<ot-  '.r-  ji4-  tto i 


UNCLASSIFIED _ 

SECURITY  CLASSiriCATiON  or  TniS  PAGE  ,BRan  Data  EoiarM) 


UNCLASSIFIED _ 

»«CUWITY  CLAMiriCATIQH  Q»  Twit  CVhOT  Of 

20. 

Consider  the  hypothesis  Mq :  6  =  iSq  >  0  in  a  linear  regression 
model  where  the  cdf  of  Y  -  tfx  is  unkown  and  Y  is  subject  to  the 
truncation  Y  <  yg  .  Testing  Hq  on  the  basis  of  n  independent 
(x^,Y.j)  with  x.|  £  ...  .<  xp  is  equivalent  to  testing  the  underlying 
homogeneity  of  the  independent  =  Y^  -  BgX^  subject  to  progressive 
truncation  <_  w^  =  yg  -  &oxi  '  ^or  ana^y2ing  astronomical  observations 
a  test  has  been  proposed  in  the  literature,  which  computes  the  sequential 
ranks  Ri  of  among  "comparable"  Vj  ,  j  <  i  satisfying 
Vj  _<  w^  and  compares  the  empirical  cdf  Hn(t)  °f  ( 2R -  -  1)/2N^  with 
t  by  a  K  -  S  statistic.  Since  (2R^  -  1)/2N^  are  neither  independent 
nor  exactly  uniform  [0,1]  ,  the  applicability  of  the  usual  asymptotic 
null  distribution  of  the  K  -  S  statistic  in  this  context  needs 
justification  which  is  provided  in  this  paper  under  a  sufficient  con¬ 
dition  requiring  that,  the  rate  of  progressive  truncation  is  not  too 
severe. 


i  N  y  12*  •.*•  J  *  4«  30 0  i 

UNCLASSIFIED _ 

;.«s$ia-caPion  : •  ••  & 


t) 


